CS 730 R : Topics in Data and Information Management – Big Data Analytics

نویسندگان

  • Steven Euijong Whang
  • Hector Garcia-Molina
چکیده

The paper presents two concepts: entity resolution (ER, record linkage) and data privacy (DP). Authors presented a sketch of a framework for managing information leakage, and studied how the framework can be used to answer a variety of questions related to ER and DP. In the paper they studied the problems of measuring the incremental leakage of critical information. The framework bases on definitions and usage of two functions – match and merge. The former function allows to detect attribute values, which describe the same entity, while the latter function merges such values into one record describing such entity. Calling these functions subsequently incrementally builds a set of data that are disclosed about described entity. Authors used disinformation as a mechanism to minimize information leakage. The paper presents a model of the problem, shows an idea of the framework, explains motivation of authors, and provides plenty of examples, but for any details refers to the technical report.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Big Data Analytics and Now-casting: A Comprehensive Model for Eventuality of Forecasting and Predictive Policies of Policy-making Institutions

The ability of now-casting and eventuality is the most crucial and vital achievement of big data analytics in the area of policy-making. To recognize the trends and to render a real image of the current condition and alarming immediate indicators, the significance and the specific positions of big data in policy-making are undeniable. Moreover, the requirement for policy-making institutions to ...

متن کامل

Application of Big Data Analytics in Power Distribution Network

Smart grid enhances optimization in generation, distribution and consumption of the electricity by integrating information and communication technologies into the grid. Today, utilities are moving towards smart grid applications, most common one being deployment of smart meters in advanced metering infrastructure, and the first technical challenge they face is the huge volume of data generated ...

متن کامل

CS 730 R : Topics in Data and Information Management – Big Data Analytics

In this paper authors introduced differential computation, which is a generalization of current techniques of incremental computations. They also introduced a definition of differential data, which shows practical application of differential computation in parallel settings. The motivation presented by authors emphasize performance of the new approach, which is very high especially for dynamica...

متن کامل

P-V-L Deep: A Big Data Analytics Solution for Now-casting in Monetary Policy

The development of new technologies has confronted the entire domain of science and industry with issues of big data's scalability as well as its integration with the purpose of forecasting analytics in its life cycle. In predictive analytics, the forecast of near-future and recent past - or in other words, the now-casting - is the continuous study of real-time events and constantly updated whe...

متن کامل

Big Data Quality: From Content to Context

Over the last 20 years, and particularly with the advent of Big Data and analytics, the research area around Data and Information Quality (DIQ) is still a fast growing research area. There are many views and streams in DIQ research, generally aiming at improving the effectiveness of decision making in organizations. Although there are a lot of researches aimed at clarifying the role of BIG data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013